Picture for Qi Zhao

Qi Zhao

Representation Forcing for Bottleneck-Free Unified Multimodal Models

Add code
May 29, 2026
Viaarxiv icon

VL-DPO: Vision-Language-Guided Finetuning for Preference-Aligned Autonomous Driving

Add code
May 19, 2026
Viaarxiv icon

Fill the GAP: A Granular Alignment Paradigm for Visual Reasoning in Multimodal Large Language Models

Add code
May 12, 2026
Viaarxiv icon

RealDiffusion: Physics-informed Attention for Multi-character Storybook Generation

Add code
May 12, 2026
Viaarxiv icon

CyberCane: Neuro-Symbolic RAG for Privacy-Preserving Phishing Detection with Formal Ontology Reasoning

Add code
Apr 26, 2026
Viaarxiv icon

Context Unrolling in Omni Models

Add code
Apr 23, 2026
Viaarxiv icon

NeuroSymb-MRG: Differentiable Abductive Reasoning with Active Uncertainty Minimization for Radiology Report Generation

Add code
Mar 02, 2026
Viaarxiv icon

SwiftRepertoire: Few-Shot Immune-Signature Synthesis via Dynamic Kernel Codes

Add code
Feb 01, 2026
Viaarxiv icon

SentGraph: Hierarchical Sentence Graph for Multi-hop Retrieval-Augmented Question Answering

Add code
Jan 06, 2026
Viaarxiv icon

Benchmarking Continuous Dynamic Multi-Objective Optimization: Survey and Generalized Test Suite

Add code
Jan 04, 2026
Viaarxiv icon